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Abstract. Let T be an unrooted tree. The chromatic symmetric function 
Xy, introduced by Stanley, is a sum of monomial symmetric functions corre- 
sponding to proper colorings of T. The subtree polynomial St, first considered 
under a different name by Chaudhary and Gordon, is the bivariatc generating 
function for subtrees of T by their numbers of edges and leaves. We prove that 
S T = Xy), where (-,-) is the Hall inner product on symmetric functions 
and $ is a certain symmetric function that does not depend on T. Thus the 
chromatic symmetric function is a stronger isomorphism invariant than the 
subtree polynomial. As a corollary, the path and degree sequences of a tree 
can be obtained from its chromatic symmetric function. As another applica- 
tion, we exhibit two infinite families of trees (spiders and some caterpillars), 
and one family of unicyclic graphs (squids) whose members are determined 
completely by their chromatic symmetric functions. 



Let G be a simple graph with vertices V(G) and edges E(G), and let P denote 
the positive integers. A (proper) coloring of G is a function k : V(G) — > P such 
that k(v) k(w) whenever the vertices v, w are adjacent. Stanley ([16] ; see also 
[17[ pp. 462-464]) defined the chromatic symmetric function of G as 



the sum over all colorings k, where xi,x^. .. are countably infinitely many com- 
muting indeterminates. This definition is invariant under permutations of {xi}, so 
Xg is a symmetric function, homogeneous of degree n = #V(G), 

The chromatic symmetric function is a much stronger isomorphism invariant 
than the well-known chromatic polynomial xc(k), a polynomial function of k that 
gives the number of colorings of G using at most k colors. Indeed, for any integer k, 
the number xc(k) can be obtained from Xg by setting x\ = • • • = xu = 1 and 
Xi = for all i > k. 

It is natural to ask whether is a complete isomorphism invariant; that is, 
whether two non-isomorphic graphs must have different chromatic symmetric func- 
tions. The answer is no; the smallest example, shown in Figure [IJ was given by 
Stanley in [IB] . Brylawski [2] introduced a graph invariant called the polychro- 
mate and constructed an infinite family of pairs of nonisomorphic graphs with the 
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XI 

Figure 1. Stanley's example [16] of two non-isomorphic graphs 
with the same chromatic symmetric function. 

same polychromatc. Sarmiento [15] proved that the polychromate is equivalent to 
the U -polynomial studied by Noble and Welsh [13j . a stronger invariant than Xq] 
therefore, each pair in Brylawski's construction shares the same chromatic sym- 
metric functions. (We thank Anna de Mier for directing our attention to these 
results.) In another direction, the non-commutative version of the chromatic sym- 
metric function, studied by Gebhard and Sagan [7j, is easily seen to be a complete 
invariant. 

Stanley's question of whether is a complete isomorphism invariant for trees 
remains open. This is equivalent to the problem of whether a tree is determined 
by its {/-polynomial, since the formula for Xq in terms of the [/-polynomial |13[ 
Theorem 6.1] is easily seen to be reversible for trees. Stanley's question was an- 
swered in the affirmative for certain special kinds of trees by Fougere [5] and the 
second author [12], both of whom listed several other tree invariants that can be 
extracted from the chromatic symmetric function. Additionally, Tan [18] has ver- 
ified computationally that the answer is "yes" for trees with 23 or fewer vertices. 
(In contrast, the chromatic polynomial is nearly useless for distinguishing trees, 
because xt(^) = k(k — l) n_1 for every tree T with n vertices.) 

Our main tool is Stanley's expansion of the chromatic symmetric function in 
the basis of power-sum symmetric functions p\ [161 Theorem 2.5]; see equation ([5]) 
below. When T is a (possibly trivial) tree, the coefficient C\(T) of p\ in Xt has a 
particularly simple combinatorial interpretation. For A C E(T), define the type of 
A to be the partition whose parts are the sizes of the vertex sets of the graph with 
vertices V(T) and edges A (see Figure[3]for an example). Then, up to a sign, c\(T) 
is the number of edge sets A of type A. As we will see, many other invariants of T 
can be recovered from Xj. 

Recall that the degree of a vertex is the number of edges incident to it. A leaf 
of a tree is a vertex of degree 1 , and the unique incident edge is called a leaf edge. 
We define the subtree polynomial of T by 

S T = S T (q,r)= q* S r* L{S \ 

subtrees S 

where the sum runs over all subtrees S of T with at least one edge, and L(S) 
denotes the set of leaf edges of S. Setting q = t(z + 1) and r = l/(z + 1) in 
St recovers the polynomial /s(T;t, z) studied by Chaudhary and Gordon in [3] 
Section 3]. Conversely, /e(T; gr, (1 — r)/r) = Sr(q,r), so the two polynomials 
provide identical information about T. 

For every non-empty set A C E(T), there is a unique minimal subset K(A) C 
E(T) - A, called the connector of A, such that A U K{A) is a tree. (So K(A) = 
if and only if A is itself a subtree of T.) The connector polynomial of T is then 
defined as 

K T = K T (x,y)= ^* A y* K(A) - 

0#ACB(T) 
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Figure 2. Two trees with the same subtree polynomial but dif- 
ferent chromatic symmetric functions. 

The polynomials St and provide equivalent information about T; we will 
prove in Proposition [4] below that each of these invariants can be obtained from the 
other. Moreover, the path sequence and degree sequence of T can easily be recovered 
from St (q, r), as observed by Chaudhary and Gordon [31 Proposition 18]. 

For a partition A = (Ai > A2 > • • • > A^) h n, and integers a, b, i, j, define 

We can now state our main theorem, which asserts that the subtree and connector 
polynomials can be recovered from the chromatic symmetric function Xy. 

Theorem 1. For every n > 1, and for every tree T with n vertices, 

K T (x,y) = ]T]>>V]>>( A ' a < 5 ) c ^ r ) (3) 

a>0 6>0 Ahn 

and 

n— 1 i 

S T (q,r) = 5^5V5>(A,t,j)c A (T). (4) 

i— 1 j — 1 Ahn 

It follows that the chromatic symmetric function is at least as strong an invariant 
as the subtree and connector polynomials. In particular, the path and degree 
sequences of T can be recovered from Xy, as announced previously in |llj ; this 
generalizes a earlier result of Fougere Theorem 3.3.1]. Section [5] contains the 
proof of Theorem [U as well as explicit formulas for the path and degree sequences, 
and a reinterpretation of (J3J) and (j4|) in terms of the usual scalar product on the 
space of symmetric functions. 

Theorem [T] implies that Xt is a stronger invariant than St- In fact, it is strictly 
stronger: the two trees shown in Figure [5] have different chromatic symmetric func- 
tions, but the same subtree polynomial. (Eisenstat and Gordon [4] constructed an 
infinite family of pairs of non-isomorphic trees with the same subtree polynomials, 
of which Figure [2] is the smallest example.) Thus Stanley's question remains open. 

As another application of the combinatorial interpretation of the coefficients 
c\(T), we identify some classes of trees for which the chromatic symmetric function 
is in fact a complete invariant. These trees include all spiders (trees having exactly 
one vertex of degree > 3) and some caterpillars (trees from which deleting all leaves 
yields a path, such as those in Figure [2]). We prove in Section [3] that every spider 
can be reconstructed from its subtree polynomial, hence from its chromatic sym- 
metric function (generalizing results of Fougere [6]). The corresponding problem 
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for caterpillars is more difficult; however, certain special kinds of caterpillars can in- 
deed be reconstructed from their chromatic symmetric functions, and the methods 
we use to prove this may be extendible to all caterpillars. 

A unicyclic graph is a graph with one cycle. Connected unicyclic graphs can be 
recognized as such from their chromatic symmetric functions. While the combina- 
torial data provided by Stanley's expansion (|5|) is not as fine for unicyclic graphs 
as it is for trees, we can still obtain some uniqueness results for special unicyclic 
graphs by mimicking our results for spiders and caterpillars. In particular, we show 
in Section [5] that no two squids (unicyclic graphs with at most one vertex of degree 
two or more) can have the same chromatic symmetric function, although it is not 
clear whether membership in the class of squids can be determined from Xg. An 
analogous result holds for crabs (unicyclic graphs in which every vertex not on the 
cycle is a leaf) satisfying an additional technical condition. 

Acknowledgements. Our collaboration began at the Graduate Student Com- 
binatorics Conference held at the University of Minnesota on April 16 and 17, 
2005. We thank the organizers of the conference for their efforts, and we thank 
Francois Bergeron, Tom Enkosky, Gary Gordon, Brandon Humpcrt, Rosa Orellana, 
Victor Reiner, Bruce Sagan, and Stephanie van Willigenburg for many helpful con- 
versations. The database of trees constructed by Piec, Malarz, and Kulakowski 
[14j . and John Stembridge's freely available Maple package SF, were invaluable for 
calculating examples and formulating (and checking small cases of) Conjectures [6] 
and [71 Finally, we thank two anonymous referees for their suggestions, and in 
particular for making us aware of Fougere's undergraduate thesis [B]. 

1. Background 

We assume that the reader is familiar with basic facts about graphs and trees 
(see, e.g., [TJ Chapter I]). We denote a graph G by an ordered pair (V 7 E), where 
V = V(G) is the set of vertices and E = E{G) is the set of edges. All our graphs 
are simple] that is, we forbid loops and parallel edges. The order of a graph is its 
number of vertices. A tree is a graph G which is acyclic and connected and for 
which #V{G) — #E(G) + 1; any two of these conditions together imply the third. 
We consider the graph with one vertex and no edges to be a tree, the trivial tree; 
unless otherwise specified, all our statements about trees include this possibility. A 
leaf of a tree is a vertex of degree 1, that is, with exactly one incident edge. Every 
nontrivial tree has at least two leaves [Hp. 11]. It is often notationally convenient 
to ignore the distinction between a graph and its edge set. 

We now review some facts about symmetric functions (for which the standard 
references are |10) and [171 Chapter 7] ) and about the chromatic symmetric function 
(introduced by Stanley in [P6]1. 

A partition is a sequence A = (Ai , . . . , A^) of positive integers in weakly decreasing 
order. The numbers A& are called the parts of A. We say that A is a partition of n, 
written A h n, if Afc = n. The number I = £(X) is called the length of A. 

Let xi,X2,--- be a countably infinite set of commuting indeterminates. For 
k G P, the k th power-sum symmetric function is 

i>l 
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Figure 3. An edge set of type (3, 3, 2, 1, 1). 



and for a partition A we define 

£(X) 

p\ = Y[p\ k 

k=l 

It is well known that {p\ | A h n} is a basis for the Q-vector space A„ consisting of 
all symmetric functions that are homogeneous of degree n. 

Let G be a graph of order n. Stanley [16j Thm. 2.5] proved that 

X G = (-l) # ^type(A), (5) 

ACE(G) 

where type(A) is the partition whose parts are the orders of the connected com- 
ponents of the subgraph of G induced by A (see Figure [3] for an example) . Note 
that type(A) depends upon A and V(G), but not on E(G). We write c\(G) for the 
coefficient of p\ in the expansion (J5]); that is, 

X G = 5> A (G)p A . (6) 

We will abbreviate c\ = c\(G) when no confusion can arise. 

The chromatic symmetric function of a graph provides much more combinatorial 
information when that graph is a tree. In general, the coefficient c\ does not count 
edge sets of type A, because (—1)# A is not constant for all such sets A. On the 
other hand, if T = (V, E) is a tree of order n, then every A C E is acyclic, so its 
induced subgraph has n — #A connected components. Hence £(type(A)) = n — j^A, 
and we obtain a useful combinatorial interpretation for the numbers c\ : 

ca = (-l) n - e ^#{A C E | type(A) = A}. (7) 
The invariants c\ are far from independent; in particular, ([7]) implies that 

£ (-l)"-^) C A(T)=( n fc 1 ). (8) 

A: £(\) = k ^ ' 

We next list some basic invariants of graphs that can be recovered from its 
chromatic symmetric function. Several of these facts were previously observed 
by the second author in [12j . For notational simplicity, we shall often omit the 
parentheses and singleton parts of a partition, for instance, writing Cj rather than 

C (i,l,l, ..,!)■ 

Proposition 2. For every graph G = (V,E), 

(i) the symmetric function Xg is homogeneous of degree #V ; 

(ii) -c 2 (G) = #£; and 

(iii) the number k of connected components of G is min{^(A) | c\(G) 0}. 
If G is a tree, then in addition 

(iv) for k>2, \cj(G)\ is the number of subtrees of G with j vertices; and 
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(v) c„_i(G) is the number of leaves of G. 

Proof. Assertion (i) is immediate from the definition of Xg, and (ii) follows easily 
from ([5J. For (hi), implies that c\ = whenever £(X) < k. On the other 
hand, c A = Ea(-!) #A = (-l) 71- *^ 1 ) 0), the sum over all A C E(G) with 
\{A) = X(E(G)), where t G is the Tutte polynomial of G (see pfl Ch. X]). Up to 
sign, this formula counts acyclic orientations of G with exactly one sink in each 
component (this follows from [IJ Thm. 8, p. 348]). In particular, c\ ^ 0. 

Assertion (iv) holds because A C E(G) is the set of edges of a j-vertex subtree 
if and only if type(A) = (j, 1, . . . , 1), and (v) follows because every subtree of order 
n — 1 is of the form G — v , where v is a leaf. □ 

By (i), (ii) and (hi) of Proposition[2j trees can be distinguished from non-trees by 
their chromatic symmetric functions. Moreover, part (v) implies that paths (trees 
with exactly two leaves) and stars (trees with exactly one non-leaf) are determined 
up to isomorphism by their chromatic symmetric functions. 

The girth of a graph G is defined as the length of the smallest cycle in G, or oo 
if G is acyclic. With a little more work, we can compute the girth of G from Xg. 
The idea is to hnd the smallest edge set for which ([8|) fails. 

Proposition 3. Let G = {V,E) be a graph with n vertices and m edges. Let k be 
the largest number such that ^2 xhn e ( X j =k c\(G) ^ ( — Then the girth 
of G is n — k + 1 . 

Proof. Let g be the girth of G. Suppose first that k > n—g + 1. Then n — k < g — 1, 
so every subset A C E with n — k edges is acyclic and hence has k connected 
components. On the other hand, if £(type(A)) — k, then the maximum size of a 
component of A is n — (k — 1) < g, so A must be acyclic and hence must have n — k 
edges. Therefore 

E ^(G) = £ {-i)* A = J2 = J 

Ahra ACE ACE V ' 

i{X)=k f(type(A))=fc #A=n-fc 

Now suppose that k = n — g + 1. We claim that ACE has k components if 
and only if it either has n — k edges (hence is acyclic) or is precisely a cycle of 
length g. The "if" direction is evident. For the "only if" direction, suppose that 
A has k components and is not acyclic, hence contains a cycle C. By definition 
of G, the length of G cannot be less than g; on the other hand, there are at least 
k — 1 vertices that do not belong to G (one for each other component of A), so 
#V(G) < n — (k — 1) = g. Thus G has length exactly g. Moreover, A — C cannot 
contain any other edge with an endpoint outside G (because then it would have 
fewer than k components) or an edge joining two vertices of G (because then G 
would contain a cycle of length < g). Hence A = C as desired. Denoting by F the 
set of ^-cycles of G, we have 

£ c x {G) = £ (-!)#*+ 

Ahn ACE ACT 

l(\)=k #A=n-k 

= ( _ 1)B _ fc ^m^ +( _ 1)n _ fc+1#r ^ 
as desired. □ 
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2. Proof of the main theorem 

Theorem [1] expresses the subtree polynomial St and connector polynomial Kr 
of a tree T in terms of the chromatic symmetric function Xj. The first step is to 
show that St and Kt are interchangeable. In what follows, we will often abuse 
notation by ignoring the distinction between a tree T = (V, E) and its edge set E. 

Proposition 4. Let T be a tree. Then: 

(1) S T («,r) = K T (qr,q(l-r)). 

(2) K T (x,y) = S T {x + y,x/(x + y)). 

Proof. For each nontrivial subtree S C T, write L(S) for the set of leaf edges of 
S. Note that #L(S) > 1, with equality if and only if S consists of a single edge. 
Moreover, observe that A U K(A) = S if and only if L(S) C A C 5. Hence 

K r (gr,<z(l-r)) - £ (?r) #A (g(l - r))W) 

ACT 

= X! ? #(AU ' f!r(A3 V #A (l - r) #K(j4) 

ACT 

E « #S E r# A (l-r)# s -# A 

subtrees SCT A: 

i(S')CAC5 

= ^g #S r #L(s) r #G (l - r ) #(s ~ L(s)) ~ #G 

S GCS-L(S) 
= ^ (z #S r ,#L(5) (r , + (1 _ r)) #(S-L(S)) = ST((?;r)j 

s 

giving the first equality. Meanwhile, solving the equations x — qr,y = q(l — r) for 
q and r yields q = x + y, r = x/(x + y), giving the second equality. □ 

We now prove the main theorem. To do so, we establish a formula for the 
connector polynomial of a tree in terms of its chromatic symmetric function, then 
apply Proposition [5] to obtain a formula for the subtree polynomial. 

Proof of Theorem^ By definition, the coefficient of x a y b in Kt(x,?/) is 
#{A CT|#i = a, #K{A) = b] = J2 1 

ACT 
#A=a, #K(A)=b 

= M) b E E E M) #c < 

ACT BCT-A CCS 
#A=a #B=b K(A)CC 

because the innermost sum vanishes unless B — K(A), when it is (— l) b . Setting 
D = B — C ', we may rewrite this expression as 



E E E 

ACT CCT-A DCT-A-C 
#A=a K(A)<ZC #D=b-#C 
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and setting F — C U A yields 

(-!)" E E E (-i) #F_a 

ACT FCT DCT — F 

#A=a AUK(A)CF #D=a+b-#F 

= M)- +i E e (*Z:# F y-^ F 

FCT A: #A=a, v ^ 7 

AUF;(A)CF 

= (-d-i: e e <-D* F 

Ahn FCT v 7 A: #A=o, 

type(F)=A AuAT(A)CF 

= (-D- +l E ( <(A) ^ ^ 1 + „) E (9) 

Ahn V W 7 FCT 

typc(F)=A 

where a(F) = #{A \#A = a, AUK(A) C F}. The set AUif(i) is connected, so if 
it is a subset of F then it must be a subset of some component of F. On the other 
hand, if F 1 is a (possibly trivial) component of F and A C F', then AU-fT(A) C F', 
because F' is a tree containing A and A U is the unique minimal such tree. 

Thus if type(F) = A then 

l(X) /A 1\ 
k=l ^ 7 

Note that this formula is valid only if a > 0. Substituting (fT0| into ([9]), we obtain 

H' 4 'E( (1 ;T t : t( )(-T™ e e^- 1 

Ahn V V ; 7 FCT fc=l 

typo(F)=A 

= 5>(A,a,6)c A (T) 

Ahn 

(where tp(X,a,b) is defined by JT])), giving the desired formula Q. 
We now turn to the proof of ((31) ■ By Proposition [|J we have 



Sr(?> r ) = K T(gr,g(l - r)) 

a>0b>0 Ahn V V > 7 fc=l V 

Setting i = a + 6, we may rewrite the last expression as 

EB-w.--rE(^"„; j )E( A '; 1 ]«m. 

i>0 o=l Ahn v v 7 7 fc=l v 
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Applying the binomial expansion to (1 — r) 1 a yields 

ED-w-E Cr)(-^ k E i™.- 1 ^ E (\" >cn. 

i>0 a=l h=0 V 7 Ahn v v y 7 k=l v 7 

Now setting h = j — a gives 

£ E-dy E G I :) (-)-v v (^_; \ ,) £ (\" 

i>0 o=l j=a VJ 7 Ahn V W 7 fc=l V 7 

and setting a — d and rearranging gives 

ee« v e (<-H-^£ ( MJ-3£(V)H 

i>0 j=l Ahn V V 7 d=l VJ 7 fe=l V 7 / 

£(A)=£ 

= EE^' E 0(A,i,j)cA(T) 

i>0 3=1 Ahn 
£(A)=^ 

which is the desired formula (|3|). □ 

Two basic invariants of a tree are its path sequence and its degree sequence. The 
path sequence of T is defined as (711, 7T2, . . . ), where tt, — 7rj(T) is the number 
of i-edge paths in T. The degree sequence of T is defined as (Si, 82, ■ ■ ■ ), where 
5j = 8j(T) is the number of degree- j vertices in T. Knowing the degree sequence 
is equivalent to knowing the star sequence (a± , a% , . . . ) , where o~f. = o~k (T) is the 
number of fc-edge stars in T. Indeed, it is not hard to see that 



**=£(;)«■ 

j>k 



for every 2 < k < n — 1, and so 

k>3 



Corollary 5. TTie degree and path sequences of a tree T can be recovered from its 
chromatic symmetric function. 

Proof. The key observation, due to Chaudhary and Gordon [3l Proposition 18], is 
that the path and star sequences of T can be recovered from St- Indeed, 7Ti is 
the number of edges of T, and for every i > 2, 7T; is just the coefficient of gV 2 in 
St(<Zj?*). Meanwhile, for every k > 1, er^ is the coefficient of q k r k . □ 

We note that Fougere had proved [6l Theorem 3.3.1] that the sum of the squared 
vertex degrees, ^jSjj 2 , could be obtained from the coefficient of the monomial 
symmetric function tomi 1 .) in Xy. 

We can rephrase the formulas for Ky and St in terms of the usual scalar product 
(•, •} on the space A„ of degree-n symmetric functions (see [HI §7.9] or [lOl §1-4]), 
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where n is the order of T. Define symmetric functions ^> n (x,y) and < & n (g,r) by 

*n(x, V) = E E E ^ A > «' ^ 

a>0 6>0 Ahn ZX 

n — 1 i 

i=l J — 1 Ahn 

Then the formulas 1(3} and ([3]) are respectively equivalent to 

K T (x,J/) = X T ), 

S T (g,r) = <$„(?, r), X T >. 

The symmetric function \& n appears to have certain positivity and integrality 
properties, as we now explain. The i th homogeneous symmetric function hi in 
indeterminates x\, X2, ■ ■ ■ is the sum of all monomials of degree i, and for a partition 
A = (Ai, . . . , Xe) we define h\ — h\ t ■ ■ ■ h\ t . The symmetric functions {h\ | A h n} 
form a vector space basis for A„ TU] , so there is a unique list of rational numbers 
£(A, G Q such that 

i,j Xhn 

Conjecture 6 (Positivity). Let /i \~ n be a partition, and let e(/Lt) be the number of 
parts of [i of even length. Then, for all integers (— l) e W£(/x, > 0. 

Conjecture 7 (z-Integrality). Let /ih n be a partition. Then, for all integers 
the number i, j)z^ is an integer. 

We have verified Conjectures [6] and [7] computationally for all n < 20, which 
we think is strong evidence that they hold for all n. A formula for £(fi,i,j) can 
be written out explicitly using the known transition matrices between bases of 
symmetric functions (see [S])- However, we do not know a direct combinatorial 
interpretation for or for ^(n,i,j)z ll . 

One might hope that for every two trees T,U with #V(T) > #V(U) > 1, 
the number of subtrees of T isomorphic to U might be given by a scalar product 
(Cu> Xt), where £f7 is some symmetric function independent of T. Such a result 
would generalize Corollary [5] (which covers only the case that U is a path or a 
star) and, by a theorem of Harary and Palmer [9 J , would imply that every tree is 
distinguished by its chromatic symmetric function. In fact, it appears that such a 
function £y exists only if U is a star or a path, as we have verified computationally 
for all U of order < 8, with one trivial exception^ 

Theorem Q] does not resolve Stanley's question, because St is not a complete 
isomorphism invariant. Indeed, the two trees Ti,T% shown in Figure [21 share the 
same subtree polynomial; this is a special case of a theorem of Eisenstat and Gor- 
don [4]. On the other hand, 7^ Xt 2 - This inequality follows from Tan's 
calculations [TB], and also for the following elementary reason. Let A C E(T\) be 
the edge set obtained by deleting the two rightmost horizontal edges in Figure 



A Maple worksheet containing the calculations is available at 
http : //math . ku . edu/~ jmart in/sour cecode/ 

z Up to isomorphism, there are three four-edge trees: the star S4, the path P4, and another 
tree U . Since the number of four-edge subtrees of T is just C5(T), we have Cu = c 5 ~ CS4 ~ Cp<i- 
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then type(A) = (7,2,2). On the other hand, no subset of -E(T 2 ) has that type. 
Therefore c (7 , 2i2) (Ti) ^ and c (7)2i2) (T 2 ) = 0. 

The remainder of the article is devoted to identifying special classes of trees T 
for which the invariants c\ suffice to reconstruct T up to isomorphism. 

3. The Chromatic Symmetric Function Distinguishes Spiders 

A tree is a spider (or starlike tree) if exactly one of its vertices has degree > 3. 
By CorollaryO whether or not a tree is a spider can be determined from its subtree 
polynomial. A spider may equivalently be defined as a collection of edge-disjoint 
paths (the legs) joined at a common endpoint t (the torso). 

Up to isomorphism, every spider on n vertices can be described by a partition 
A h n — 1 whose parts are the lengths of its legs (so £(A) > 3). We denote the 
corresponding spider by T\. Note that £(X) equals both the number of leaves of T\ 
and the degree of its torso. 

We will show that the isomorphism type of a spider can be determined from 
its subtree polynomial, hence from its chromatic symmetric function. Fougere [6l 
Chapter 2] had previously shown that forks (spiders with exactly one leg of length 
> 1 and extended stars (spiders in which every leg has length k or k + 1 for some 
k) could be reconstructed from their chromatic symmetric functions. 

Before continuing, we describe a combinatorial problem whose solution will play 
a role in the proof. Let mi , . . . , rrik be nonnegative integers with ^ TOj = t. Suppose 
that we have £ distinguishable boxes, of which TOj have capacity % for each i 6 [fc]. 
Let n(mi, . . . , mfe) be the number of ways of distributing k indistinguishable balls 
among these boxes so that no box is filled beyond its capacity. In general, it is 
not easy to write down a closed formula for Q (mi, . . . , rrife), although individual 
instances can be computed using an inclusion-exclusion argument (for example). 

Theorem 8. Let Ahn-1 be a partition with £ — £(X) > 3, and let T = T\ be the 
corresponding spider. Then T can be reconstructed from its subtree polynomial. 

Proof. First, suppose that i — 3. Then Ai + A 2 , Ai + A 2 + A3, and AiA 2 A3 are 
respectively the diameter, number of edges, and number of three-leaf subtrees of 
T\. These invariants can be recovered from Sa, and together they determine A. 

Now suppose that £ > 3. Let nik denote the number of parts of size k in A, and 
let s(i,j) denote the number of subtrees of the spider T with i edges and j leaf 
edges (that is, the coefficient of q % r^ in Sr(q,r)). We will show by induction on k 
that mfc can be calculated from the numbers s(i,j). 

First, suppose k = 1. Since there is a bijection between legs of T of length 1 
(i.e., consisting of a single edge) and subtrees with n — 2 edges and I— 1 legs (which 
are formed by deleting such an edge). Hence mi = s(n — 2, £ — 1). 

For k > 1 , we can choose a subtree S C T with n — 1 — k edges and £ — 1 leaves as 
follows. First, fix j G [k] and delete a leg of length j; there are nij ways to do this. 
If j < k, then we still need to delete k — j more edges. It suffices to specify how 
many edges to delete from the end of each of the other £ — 1 legs, so the number 
of ways to do this is the solution to the balls-in-boxes problem described above, 
regarding the k — j edges to be deleted as balls and each remaining leg of length 
j as a box of capacity min(i — 1, k — j) (since deleting the entire leg will result in 
a tree with fewer than £ — 1 leaves). Therefore s(n — k — 1, £ — 1) is given by the 
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formula 

L/c/2] 

TOfc + m 3^( m 2, ■ ■ ■ ,mj_i,mj - l,m i+ i, . . . ,m k -j,l- {m x H h m k -j)) 

3 = 1 

k-1 

+ mjVL(m 2 , ■ ■ ■ , TOfe-j, £ - 1 - (mi H hm(,_j)). (11) 

3=[_k/2\+l 

By induction, the s(i,j) determine mi, . . . ,Wfe_i, and equation pip implies that 
they determine m k as well. □ 

Another way of reconstructing a spider from its chromatic symmetric function 
will be useful in Section [5J Define the two-part portion Xg of Xq by 

X G = ]T c Aj3A . (12) 

f(A)=2 

While Xg is evidently a much weaker invariant than Xg, it contains enough infor- 
mation to distinguish among spiders. 

Theorem 9. Let T be a spider of order n. Let m = |_§J, let e = n — 2m, let 
d-a = \c a ,n-a{T)\ for 1 < a < m, and let d(T) — (di, ■ ■ ■ ,d m ). Then one of the 
following conditions holds: 

(1) The sequence d(T) is a partition, that is, d\ > d 2 > • • • > d m > 0. 

(2) There is a number t < m such that d\ > ■ ■ ■ > d t — 1. Moreover, dt+x — 
d t +2 = ■ ■ ■ = dm-i = 2, and d m E {1, 2}. 

In the first case, let fi = d(T). In the second case, define a partition fi from d(T) 
by replacing all 2's after the t th place with two Vs. 

Then T = T\, where A is the conjugate partition to \x. 

Proof. For each edge e 6 E{T), denote by ||e|| the minimum of the orders of the com- 
ponents of T— e, so that d a = ff{e\ ||e|| = a}. If we label the legs of T as L\, . . . , L(, 
where Li has Xi edges, and label the edges of each Li as e^i, ej,2, ■ ■ ■ , ei,A i5 starting 
from the leaf and reading in toward the torso, then \\eij\\ = min(j, n — j). 
Case 1: Ai < n — Ai. Then j < n — j for every edge eij. In this case d a = 
ff{i | Xi > a}, and d(T) is just the conjugate partition of A. 

Case 2: Ai > n — X\. If i ^ 1, then A, < (A2 + ■ ■ • + Afc) — 1 = n — Ai — 2, and 
ll e ij ll = j f° r every j. So we can give a formula for d(T) in terms of A: 

I Xi > a} if 1 < a < n - X x - 1, 
2 if n- Ai < a < m- 1, (13) 

1 + e if a = m. 

Note that d n _ Al _i = 1, because e\ n _ Al _x is the unique edge whose deletion 
contains a component of order n — X\ — 1. On the other hand, n — X\ — 2m+e — X\ < 
2m + e — (m + 1) = m + e — 1, so d n _\ 1 = 2 whether n is odd or even. Therefore, 
d(T) is not a partition, but has the form described in case (2) of the theorem. 
Formula (fl"3f implies that if we "iron out" d(T) by replacing every 2 after the 
(n — Xi) th entry with two l's, we will obtain the conjugate partition to A. 

We now see how to recover A, and thus the isomorphism type of the spider T\, 
from the data d(T\). Either d(T\) is a partition, in which case it is the conjugate 
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of A, or it has the form just described, in which case the "ironing-out" operation 
yields the conjugate of A. This is precisely the statement of the theorem. □ 

4. Chromatic Symmetric Functions of Some Caterpillars 

A caterpillar is a tree T with the property that the induced subgraph on the 
non-leaf vertices is a nontrivial path, called the spine of T. That is, every vertex 
of T cither lies on the spine, or is a leaf whose unique neighbor lies on the spine. 
Since the spine is nontrivial, a caterpillar must have at least four vertices. 

While Eisenstat and Gordon's result in [4] rules out the possibility of distinguish- 
ing caterpillars by their subtree polynomials, there is still reason to hope that the 
additional information provided by the chromatic symmetric function of a caterpil- 
lar may suffice to reconstruct it up to isomorphism. 

Our first result is that the chromatic symmetric function distinguishes caterpil- 
lars from non-caterpillars. The number of leaves and the diameter of a tree T (the 
maximum length of a path in T) can be recovered from Xr by Corollary [5l so it 
suffices to prove the following fact. 

Proposition 10. Let T be a tree with n > 4 vertices. Then T is a caterpillar if 
and only i/diam(T) — 1 = n — Si(T). 

Proof. If T is a caterpillar, then every path of maximum length consists of the spine 
together with a leaf attached to each of its endpoints, hence contains all the non-leaf 
vertices and two other (leaf) vertices. In particular, the number of edges in such a 
path is one more than the number of non-leaf vertices. On the other hand, if P is 
a path of maximum length in T, then the internal vertices of P are not leaves of T 
(because each has two neighbors in P) but its endpoints are (otherwise P could be 
lengthened). If diam(T) — 1 = n — S%(T), then all the vertices not lying on P must 
be leaves, which is to say that T is a caterpillar. □ 

Let T be a caterpillar with spine vertices vq, . . . , v s . For each i, let = deg(«i) — 
1, where deg(vi) denotes the degree of the vertex Uj. Gordon and McDonnell [8j 
Lemma 2] showed that the numbers are almost determined by the path sequence 
of T, and are indeed determined by the path sequence when the caterpillar is 
symmetric (that is, ei = e s -i for < i < s). Therefore, every symmetric caterpillar 
is determined up to isomorphism by its chromatic symmetric function, a result 
proved in another way by the second author [12l Theorem 4.3.1]. 

We now describe another class of caterpillars that can be reconstructed from 
their chromatic symmetric functions. We retain the labeling of the vertices of T as 
vq, . . . ,v s . Let fi be the number of leaves adjacent to Uj, so that /, = deg(vi) — 1 
for i = 0, s and fi = deg(i>i) — 2 for < i < s. Call /j the i th leaf number of T. In 
addition, call a partition A singleton-free if all its parts are at least 2. 

Theorem 11. Let T be a caterpillar whose leaf numbers fi are strictly positive and 
distinct. Then T can be reconstructed from its chromatic symmetric function. 

Proof. Let L be the set of leaf edges of T. Since fi>0 for all i, every spine vertex 
is adjacent to at least one leaf. Therefore, the edge sets ACT such that type(A) is 
singleton-free are precisely those that contain L. In particular, A = type(L) is the 
unique singleton- free partition with s + 1 parts whose coefficient c\(T) is non-zero. 
Up to reordering, the parts of A are the numbers fo + 1, • • ■ , f s + L Furthermore, 
for each spine edge = Vi-iVi, the edge set LU{ei} contributes (— 1)™~ S to c Pi (T), 
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where m = {/ + 1, . . . , fi-i + 1, ft-i + /; + 2, f i+1 + 1, . . . , f s + 1}. Note that 
the partitions fj,i are all distinct. Moreover, fii has s — 1 of its parts in common 
with A; the remaining two parts of A must be the leaf numbers of adjacent spine 
vertices of T. (This statement is valid even if the parts of /ii are not all distinct.) 
In this way we can recover the leaf numbers of the endpoints of every edge of T, 
and this data specifies the caterpillar up to isomorphism. □ 

Using the same argument, we can relax the condition of the theorem slightly: 
we need only require that all leaf numbers are positive and that for each k, the set 
of spine vertices with leaf number k form a subpath of the spine. 

5. Unicyclic Graphs: Squids and Crabs 

Despite the title of this article, we devote the last section to a family of graphs 
G that are not trees, but rather unicyclic] that is, G contains a unique cycle. 
Equivalently, c = n — e + 1, where c, n, e are the numbers of components, vertices 
and edges respectively, so unicyclicity can be detected from Xq by Proposition [2] 
While we can no longer interpret the coefficients c\ as in ([7]), we can use Stanley's 
expansion of in terms of broken circuits [HI Thm. 2.9]. A special case of that 
result is the following: if G has a unique cycle C and eo is an edge of C, then 

X G = J2 (- 1 ) #A ftype(^)- (14) 
ACB(G) 
C-eogA 

We do not know whether there exist two unicyclic graphs with the same chro- 
matic symmetric functions. 

A squid is a connected unicyclic graph with a single vertex v of degree greater 
than 2. Note that v must lie on the cycle. A squid is described up to isomorphism 
by the length of its cycle and of the tentacles (the paths from the leaf vertices to v). 
For example, if T\ is the spider whose leg lengths are given by the parts of A, then 
adding an edge between the leaves at the ends of the two longest legs produces a 
squid with cycle length Ai + A2 and tentacle lengths A3, A4, . . . . 

It is not clear how to determine from Xg whether or not a unicyclic graph G is 
a squid (for instance, we cannot recover the degree sequence of an arbitrary graph 
from its chromatic symmetric function as we can for a tree). Nevertheless, the 
following uniqueness result does hold. 

Theorem 12. No two non-isomorphic squids have the same chromatic symmetric 
function. 

Proof. Let S be a squid with unique cycle C. Let k + 1 be the length of C; this 
number can be recovered from X5 by Proposition [3] Let v be the unique vertex 
of S of degree > 2, and let /i be the partition whose parts are the edge lengths of 
the tentacles. Label the edges of C as eo, e\, . . . , e^, starting at v and proceeding 
around the cycle. By (|14[) and inclusion-exclusion, we obtain 




i=l \ ei gA 
k 

= E"^ S '- ei_ E ^S-s.-e, 

i=l l<i<j<k 
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where the omitted terms involve edge sets A lacking three or more edges from C— bq. 
Deleting three or more edges from S produces a graph with three or more connected 
components, so passing to two-part portions as in (|12| yields Xg = 53i=i ^s- ei — 
J2i<i<j<k ^-S-ei-ej - Each graph S — e, ; — ej has exactly two components, of sizes 
j — i and n — j + i. Removing additional edges will strictly increase the number of 
components, so 

fc 

i=l \<i<j<k 

where T^.i^-i = S — is the spider with legs whose lengths are i, k — i, and the 
parts of /i. Therefore, X5 determines the quantity 

fe 

^X w _ i =X s + (-^PU^n-i+t)- (15) 

i=l l<i<j<k 

Leaving the foregoing calculations aside for the moment, we note that if T is 
a tree with n — 1 edges, then to calculate X^ we need only consider the edge 
subsets of cardinality n — 2. In particular, if T = T\ is a spider, then X^ = 
(— l)™ -2 ^2jLiP(j,n-j)- It follows that for any partition fi and numbers k,i, 

we have 

(k i k—i 

3=1 3=1 3=1 





which can be computed from X5 using (|15[) . Meanwhile, by Theorem [9l we can re- 
construct the spider T^j, from Xt m k ■ In particular, we can reconstruct the partition 
fj,, which gives the tentacle lengths of the squid 5. □ 

Just as squids can be regarded as the unicyclic analogues of spiders, the unicyclic 
analogues of caterpillars are crabs: connected unicyclic graphs in which every vertex 
not lying on the cycle is a leaf. In analogy to Theorem [T^] and its proof, we 
ask whether is it possible to use the results of Section 0] to classify the chromatic 
symmetric functions of (some) crabs. As a starting point, we prove the following 
analogue of Theorem [TT1 

Proposition 13. Let G be a crab such that the degrees of its non-leaf vertices are 
all distinct and all greater than 2. Then, if H is another crab with this property, 
and Xg = X# , then G = H . 

Proof. We will show that the coefficients of Xg, together with the knowledge that 
G is a crab with the property just mentioned, determine G up to isomorphism. 

The girth g of G can be recovered from Xg by Proposition^ Let C be the unique 
cycle of G, and label its vertices in cyclic order as v\, . . . , v g . Let /, = deg(ui) — 2 be 
the number of leaves adjacent to u,. Note that G can be specified up to isomorphism 
by the cyclically ordered list of numbers /1, . . . , f g . 

Let L denote the set of leaf edges of G. The subsets of E(G) whose type is 
singleton-free are precisely those that contain L as a subset. In particular, L itself 
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is the unique edge set whose type is a singleton-free partition of length g. Thus 
type(L) = {/i — 1, /2 — 1, . . . , f g — 1} can be recovered from Xg- Moreover, there 
are precisely g edge sets whose type is a singleton- free partition of length g — 1; 
these edge sets are of the form L U {e} for some e G C. Just as in Theorem [TY] 
the types of these edge sets are all distinct, and they specify which pairs of the fi 
correspond to adjacent vertices of the cycle. □ 
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